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Abstract. We introduce a minimal extended evolving model for small- world 
networks which is controlled by a parameter. In this model the network growth 
is determined by the attachment of new nodes to already existing nodes that 
are geographically close. We analyze several topological properties for our model 
both analytically and by numerical simulations. The resulting network shows some 
important characteristics of real-life networks such as the small- world effect and a high 
clustering. 
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1. Introduction 

Many real-life systems display both a high degree of local clustering and the small- world 
effect [H |21 El El- Local clustering characterizes the tendency of groups of nodes to be 
all connected to each other, while the small-world effect describes the property that any 
two nodes in the system can be connected by relatively short paths. Networks with 
these two characteristics are called small-world networks. 

In the last few years, a number of models have been proposed to describe real- 
life systems with small-world effect. The first and the most widely-studied model 
is the simple and attractive small-world network model of Watts and Strogatz (WS 
model) which triggered a sharp interest in the studies of the different properties of 
small- world networks PU El H] • Barthelemy and Amaral studied the origins of the 
small- world behavior in Ref. [Hj. Barrat and Weigt addressed analytically as well as 
numerically the structure properties of the WS model [7|. Amaral et al. investigated 
the statistical characteristics of a variety of diverse real-life networks [Bj. Latora and 
Marchiori introduced the concept of efficiency of a network and found that small-world 
networks are both globally and locally efficient [Oj. In Refs. [THl El El E2] , the spread 
and percolation properties were investigated, dealing with the spread of information 
and disease along the shortest path in the graph or the spread along the spanning 
tree. Recently, researchers have also focused their attention on other different aspects, 
characterizing many properties of small- world networks El El El El El 120] • 

In addition to the above-mentioned aspects, variations of the WS model are 
another focus of recent interest. Of these variants, a model proposed independently 
by Monasson [21] and by Newman and Watts ^T] , has been thoroughly studied [221 E3] ■ 
In 1999, Kasturirangan presented an alternative version to the WS model [21], a special 
case of which is exactly solvable [23] . One year later, Kleinberg provided a generalization 
of the WS model which is based on a two-dimensional lattice [2E1I2I]- The above models 
are all random. In fact, small- world networks can be also created by deterministic 
techniques such as modifications of some regular graphs [2Ej, addition and product of 
graphs [23- During the past few years, networks generated in deterministic ways have 
been also intensively studied 001 Ell E21 ESI E31 ESI EZl EH1 E0^ 

All the above models may partially mimic aspects of real-life small-world networks. 
Furthermore, these models are probably reasonable illustrations of how some networks 
are shaped. However, the small-world effect is much more general, and it is of interest 
to investigate other mechanisms producing small-world networks. Recently, Ozik, Hunt 
and Ott have introduced a simple evolution model (OHO model) of growing small- world 
networks with geographical attachment preference, in which all connections are made 
locally to geographically nearby sites [H]. Zhang, Rong and Guo have presented a 
deterministic small- world model (ZRG model) created by edge iterations [12], which 
is a deterministic version of a special case of the OHO model and a variant of the 
pseudofractal scale- free network |82j . 

The OHO model and ZRG model may provide valuable insights into some existing 
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real-world systems. It is then a natural question whether there is an encompassing 
scheme, which can put these two specific models into a more general perspective. In 
this paper, we propose a general scenario for constructing evolving small- world networks. 
Similar to the OHO and ZRG models, in our model, when a new node is added to the 
network, it is only connected to those preexisting nodes that are geographically close to 
it. Our model results in an exponential degree distribution, a large clustering coefficient 
and small average path length (APL), with values close to those known for many random 
small- world networks HU 123 123 123 123 123 HJj- Interestingly, our model includes a 
parameter q which controls part of the structural properties of the evolving small- world 
networks. Moreover, by tuning this parameter, one can obtain the OHO model and the 
ZRG model as particular cases of our model. 

The rest of this paper is organized as follows. Section 2 provides a detailed 
description of the construction for this evolving small-world network model. In Section 
3, we give analytical and simulation results of the main network properties: Degree 
distribution, clustering coefficient and average path length. The final section provides 
some conclusions. 

2. Evolving small- world network model 

In this section we describe a model of growing network, which is constructed in an 
iterative manner. We denote our network after t time steps by N(t). Then the network 
is constructed in the following way. We start from an initial state (t — 0) of m + 1 
(m even) nodes distributed on a ring all of which are connected to one another. For 
t > 1, N(t) is obtained from N(t — 1) as follows: For each internode interval along the 
ring of N(t — 1), with probability q, a new node is created and connected its m nearest 
neighbors (y on either side) previously existing at step t — 1. Distance, in this case, 
refers to the number of intervals along the ring. The growing process is repeated until 
the network reaches the desired size. Figure 1 shows the network growing process for a 
special case of m = 2 and q = 1. 

When q = 1 and m = 2, the network is reduced to the deterministic ZRG model |42j . 
If q < 1, the network is growing randomly. Especially, as q approaches to zero (without 
reaching this value) the model coincides with the OHO model where at each time 
step, only one interval is chosen and linked to its m nearest neighbors, with every interval 
having the same probability of being selected (see j3H| for interpretation). Varying q in 
the interval (0,1) allows one to study the crossover between the OHO model [H] and 
the ZRG model [32] • It should be mentioned that as q is a real number, below we will 
assume that all variables concerned with q change continuously. Notice that similar 
presumption has been used in Refs. [H 121 IS], which is valid in the limit of large t. 

Now we compute the number of nodes and edges of N(t). We denote the number of 
newly added nodes and edges at step t by L v (t) and L e (t), respectively. Thus, initially 
(t = 0), we have L v (0) — m + 1 nodes and L e (0) = m{m + l)/2 edges in iV(0). Let 
N c (t) denote the total number of internode intervals along the ring at step t, then 
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Figure 1. Illustration of the growing small- world network for m = 2 and q = 1, 
showing the first three steps of the iterative process. 



N c (0) — m+ 1. By construction, we have L v (t) = N c (t — l)q for arbitrary t > 1. Note 
that, when a new node is added to the network, an interval is destroyed and replaced 
by two new intervals, hence the number of total intervals increases by one. Thus, we 
have the following relation: N c (t) = N c (t — 1) + L v (t). On the other hand, the addition 
of each new node leads to m new edges, after simple calculations one can obtain that at 
U {U > 1), L v (ti) = (m + 1)(1 + q) ti ^ 1 q and L e (U) = m(m + 1)(1 + q) u ~ l q, respectively. 
Therefore, the number of nodes N t and the total of edges E t of N(t) is 

N t = Y i L v (t j ) = (m + !)(! + qf (1) 



and 



E t = J2L e (t J )=m(m+r 



respectively. The average node degree is then 



< k >i 



2^ 



2m 



(2) 



(3) 



2(1 + q y 

For large t and any q ^ 0, it is small and approximately equal to 2m. Notice that many 
real-life networks are sparse in the sense that the number of edges in the network is 
much less than N t (N t — l)/2, the number of all possible edges [UEIIS]- 



3. Structural properties of the evolving small-world Network 

Structural properties of the networks are of fundamental significance to understand the 
complex dynamics of real-life systems. Here we focus on four important characteristics: 
degree distribution, clustering coefficient, average path length and diameter. 
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3.1. Degree distribution 

Degree is the simplest and most intensively studied characteristic of an individual node. 
The degree of a node % is the number of edges in the whole network connected to %. The 
degree distribution P(k) is defined as the probability that a randomly selected node has 
exactly k edges. Let kiit) denote the degree of node i at step t. If node i is added to 
the network at step tj then, by construction, fcj(tj) = m. In each of the subsequent time 
steps, there are m intervals with ^ at each side of i. Each of these intervals could be 
considered, with probability q, to create a new node connected to i. Then the degree 
ki(t) of node i satisfies the relation 

fci(t) = ki(t- 1) + mq (4) 

considering the initial condition ki{tj) = m, we obtain 

ki(t) = m + mq(t - U) (5) 

The degree of each node can be obtained explicitly as in Eq. and we see that this degree 
increases at each iteration. So it is convenient to obtain the cumulative distribution j3] 

oo 

Pcum(k) = E P ( k ') (6) 
k'=k 

which is the probability that the degree is greater than or equal to k. An important 
advantage of the cumulative distribution is that it can reduce the noise in the tail 
of probability distribution. Moreover, for some networks whose degree distributions 
have exponential tails: P(k) ~ e~ fc / K , cumulative distribution also gives exponential 
expression with the same exponent: 

oo oo 

Pcumik) = £ P(k') ~ J2 e ~ k ' /K ~ e ~' k/K (7) 
k'=k k'=k 

This makes exponential distributions particularly easy to spot experimentally, by 
plotting the corresponding cumulative distributions on semilogarithmic scales. 

Using Eq. we have P cum (k) = ££? =fc P(k) = p(t> <r = t- (^)). Hence 

P m - V h&l = m + 1 4. V (m + lXl + g)^ 
n U mW ^ Nt {m + m + q)t + ^ {m+1){1 + q y 

k — m 

= (i + q y— (8) 

The cumulative distribution decays exponentially with k. Thus the resulting network is 
an exponential network. Note that most small-world networks including the WS model 
belong to this class [Zj. 

In Fig. (J2J, we report the simulation results of the cumulative degree distribution 
for several values of q and with m = 2. Except in the deterministic case 5 = 1, the degree 
spectrum of the networks is continuous. From Fig. (|2]L we can see that the cumulative 
degree distribution decays exponentially for large degree values, in agreement with the 
analytical results and supporting a relatively homogeneous topology similar to most 
small- world networks [g[nj|22l21l23llHI2IllII!- Other values of m should give 
qualitatively a similar behavior as for m = 2. 
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Figure 2. Semilogarithmic graph of the cumulative degree distribution for the evolving 
networks in the case of to = 2 and for different values of q. All data points are obtained 
by averaging ten independent simulations. 



3.2. Clustering coefficient 

Most real-life networks show a cluster structure which can be quantified by the clustering 
coefficient El H] ■ The clustering of a node gives the relation of connections of the 
neighborhood nodes closest to it. By definition, the clustering of a node i with ki 
adjacent nodes is given by C{ = 2ei/[ki(kt — 1)], where e« is the number of existing 
edges between its neighbors. The clustering coefficient C of a network is obtained by 
averaging Cj over all the vertices in the network. 

For the particular case m = 2, using the connection rules, it is straightforward to 
calculate exactly the clustering coefficient of an arbitrary node and the average value 
for the network. When a node % enters the network, ki and e% are 2 and 1, respectively. 
After that, if the degree ki increases by one, then its new neighbor must connect one of 
its existing neighbors, i.e. increases by one at the same time. Therefore, is equal 
to ki — 1 for all vertices at all time steps. So there exists a one-to-one correspondence 
between the degree of a node and its clustering. For a node v with degree k, the 
exact expression for its clustering coefficient is 2/k, which has been also been obtained 
in Ref. pl2*] 132"] 133] . This expression for the local clustering shows the same inverse 
proportionality with the degree than the observed in a variety of real-life networks [34J. 

In addition to the good scaling of the clustering coefficient for single node, the 
average clustering coefficient C of the network is very high. Also, C depends on q 
and approaches to a constant asymptotic value as the network order is very large. In 
Fig. (JHJ), we show C as a function of q in the case of m = 2. From Fig. (JHJ), one can 
see in the infinite order limit of the network, that C approaches to a nonzero constant 
value. Simulations exhibit that C equals to 0.6482, 0.6560, 0.6640, 0.6729 and 0.6828 for 
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Figure 3. Average clustering coefficient C vs q when m — 2. Each data point is an 
average over ten independent simulation runs. 

q = 0.005, 0.2, 0.4, 0.6, and 0.8, respectively. Fig. (J3J) reflects the dependence of C, the 
clustering coefficient of the network, on q. It is obvious that C increases continuously 
with q. As q increases from to 1, C grows from |ln3 — 1 |H] to In 2 02], i>e. from 
0.6479 to 0.6931. The reason for this dependence relation would need further study, but 
might be related to a biased choice of the edges chosen at each iteration, see Ref. |45j . 
Although we only focus on the case m — 2, one expects that for other values of m, C 
also will converge to a different nonzero value for every different value of q (see Ref. jH] 
for a particular case). 

3.3. Average path length 

Certainly, the most important property for an small-world network is a logarithmic 
average path length (APL) (with the number of nodes). It has obvious implications for 
the dynamics of processes taking place on networks. Therefore, its study has attracted 
much attention. Here APL means the minimum number of edges connecting a pair 
of nodes, averaged over all pairs of nodes. Below, using an approach similar to that 
presented in jJH], we will study the APL of our network for the particular case m = 2. 

We label each of the network nodes according to their creation times, v = 
1,2,3, ... ,N — 1,N. We denote L(N) as the APL of our network with order N. It 
follows that L(N) = J$=iT, where e(N) = £i 

<i<j<N^i,j is the total distance, where 
£ij is the smallest distance between node % and j. 

For this special case m — 2, any newly-created node is actually only attached to 
both ends of an edge. Thus the distances between existing node pairs will not be affected 
by the addition of new vertices. Then we have the following equation: 

N 

L(N + l) = L(N)+Y,kN+i (9) 
i=i 
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Figure 4. Semilogarithmic graph of the dependence of average path length on network 
order N in the case of m — 2 and q = 0.5. All values plotted are averages over ten 
independent realizations. The values can be fitted well by a straight line. 



Like in the analysis of |H3IIZ|, Eq. (jUJ) can be rewritten approximately as: 

L(N + 1) « L(N) + N + (N - 2)L(N - 1) (10) 
After some derivations, we can provide an upper bound for the variation of e(N) as 

which leads to 

e(N) = N 2 \nN + (3, (12) 

where (3 is a constant. As e(N) ~ iV 2 lniV, we have L(N) ~ In A/". Therefore, we have 
proved that in the special case of m = 2 of our model, there is an slow growth of the 
APL with the network size N. In Fig. (jlj), we present the APL vs the network order 
N in the case of m = 2 and q = 0.5. We see that the APL behaves logarithmically as 
a function of N. We expect that for other values of q, the APL will present a similar 
behavior. In fact, in the case of q — 1, we can compute exactly the diameter of the 
network (i.e. the maximum distance between all pairs of nodes). A sharp analytical 
proof shows that the diameter also grows logarithmically with the number of nodes of 
the network j32]- It should be noted that in our model, considering values of m greater 
than 2, then the APL will increase more slowly than in the case m = 2 as in those cases 
the larger m is, the denser the network becomes. 

Similar to Refs. W2\ , the interpretation for the slow growth of APL is as follows. 
The older nodes that had once been geographically proximal along the ring are pushed 
apart as new nodes are positioned in the interval between them. From Fig. ^] we can 
see that when new nodes enter into the network, the original nodes are not near but, 
rather, have many newer nodes inserted between them. Thus, the network growth 
creates "shortcuts" attached to old nodes, which join remote nodes along the ring one 
another as in the WS model [Sj. 
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3.4- Diameter for deterministic networks 

As we have mentioned above the diameter of a network is the maximum of the 
distances between all pairs of nodes, characterizing the longest communication delay 
in the network. Small diameter is consistent with the concept of small-world. In the 
deterministic case q — 1, we denote N(t) as N q= i(t) and Diam(N q= i(t)) as the diameter 
of N q= i(t) which can be computed exactly. But here we only give an upper bound on 
the diameter. The obtained bound scales logarithmically with the order of the networks. 
Now we present the main ideas of this analysis as follows. 

Clearly, at step t = 0, Diam(N q= i(0)) equals to 1. At each step t > 1, we call 
newly-created nodes at this step active nodes. Since all active nodes are attached 
to those nodes existing in N q= i(t — 1), so one can easily see that the maximum 
distance between arbitrary active node and those nodes in N q= \(t — 1) is not more 
than Diam(N q= i(t — 1)) + 1 and that the maximum distance between any pair of active 
nodes is at most Diam(N q= i(t — 1)) + 2. Thus, at any step, the diameter of the network 
increases by 2 at most. Then we get 2(t+l) as an upper bound of Diam(N(t). Note that 
the logarithm of N q= i(t) is ln((m+l)2*) = t ln2+ln(m+l), which is approximately equal 
to (t + 1) In 2 in the limit of large t. Thus the diameter grows at most logarithmically 
with the network order. Since our aim here is to show that the network diameter is 
small, so we only give a rough upper on diameter not more exact than that in [32] ■ 

4. Conclusion 

To sum up, we give here a simple evolving model for small-world networks. During 
the network growth, new nodes do not have a complete knowledge of all the current 
network nodes, but are attached to those preexisting sites that are geographically close 
to them. We have obtained both analytically and numerically the solution for relevant 
parameters of the network and we have verified that our model exhibits the classical 
characteristics of small-world network: a high clustering and a short APL. In addition, 
the model under consideration is actually a tunable generalization which includes as 
particular extreme cases the models introduced in Refs. [31] and [12! ■ Moreover, the 
networks can model a variety of real-life networks whose topologies are influenced by 
such geographical constraints. 
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